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Abstract 

Multi-agent complex systems comprising populations of decision-making particles, have wide 
application across the biological, informational and social sciences. We uncover a formal analogy 
between these systems' time-averaged dynamics and conventional many-body theory in Physics. 
Their behavior is dominated by the formation of ' Crowd- Anticrowd' quasiparticles. For the specific 
example of the Minority Game, our formalism yields analytic expressions which are in excellent 
agreement with numerical simulations. 
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Multi-agent simulations are currently being used to study the dynamical behavior within a 
wide variety of Complex Systems Q]. Within these simulations, N decision- making particles 
or agents (e.g. commuters, traders, computer programs, cancer/normal cells, guerillas ^,0, 
3]) repeatedly compete with each other for some limited global resource (e.g. road space, 
best buy/sell price, processing time, nutrients and physical space, political power) using 
sets of rules which may differ between agents and may change in time. The population 
is therefore competitive, heterogeneous and adaptive. A simple minimal model which has 
generated more than one hundr ed p apers since 1998, is the Minority Game (MG) of Challet 
and Zhang llO]. Numerical simulations can yield fascinating results - 

however it is very hard to develop a general yet analytic theory of such multi-agent systems. 

Given that a multi-agent population is a many-body interacting system with the addi- 
tional complication of the particles being decision-making, one wonders whether it might be 
possible to develop a generalized 'many-body' theory of such systems. This is a daunting 
task since the success of conventional many-body theory relies on the fundamental physical 
particles having a relatively simple internal configuration space (e.g. spin) which is identical 
for each particle - moreover, the particle-particle interactions are time-independent By 
contrast, each decision- making particle lives in a complex configuration space represented 
by the information it receives and the particular strategies which it happens to possess. In 
addition, the agent-agent interactions generally evolve in time and depend on prior history. 
However, it is precisely these difficulties which make this problem so interesting to a theo- 
retical physicist. In addition to the important real- world applications listed above, such a 
generalized many-body theory could be applied to physical systems where internal degrees 
of freedom can be created artificially. An interesting technological example concerns an 
array of interacting or interconnected nanostructures, where each nanostructure has its own 
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active defects which respond to the collective actions of the others 

Here we propose a general many-body- like formalism for these complex A^-body systems. 
Inspired by conventional many-body theory ^l| , it is based around the accurate description 
of the correlations between groups of agents. We show that the system's fiuctuations will 
in general be dominated by the formation of crowds, and in particular the anti-correlation 
between a given crowd and its mirror-image (i.e. 'anticrowd'). The formalism, when applied 
to the MG, yields a set of analytic results which are in excellent agreement with the numerical 



findings of Savit et al. (see Fig. 1) p]. We note that there have been many other MG theories 
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proposed to date |5|, yet none of these has provided an analytic description of the Savit-curve 
^ over the entire parameter space. In what follows, we do not restrict ourselves to the MG 
- for example, our formalism can be easily generalized to multiple options [10']. We hope 
that our results stimulate many-body physicists to investigate transferring their techniques 
to these more general systems and real-world applications. 

Consider agents (e.g. commuters) who repeatedly decide between two actions at each 
timestep t (e.g. +1/ — 1 = take route A/B) using their individual S strategies. Our 
formalism will apply to a wide variety of multi-agent games since it is reasonably insensitive 
to the game's rules concerning strategy-choice, rewards, and the definition of the winning 
group. The agents have access to a common information source fi{t) which they use to 
decide actions. This information may be global or local, correct or wrong, internally or 
externally generated. Each strategy, labelled R, comprises a particular action for each 
fi{t) G {fi(t)}, and the set of possible strategies constitutes a strategy space 6 = {R}- The 
strategy allocation among agents can be described in terms of a rank-S" tensor \l/ [2| where 
each entry gives the number of agents holding a particular combination of S strategies. We 
assume \l/ to be constant over the timescale for which time-averages are taken. A single \l/ 
'macrostate' corresponds to many possible 'microstates' describing the specific partitions of 
strategies among the A^ agents To allow for large strategy spaces and large sets of global 
information, we consider {R} and fi{t) to be numbers on the line from R = 1 -Rmax? and 
from fi = 1 ^ /imax respectively. For small strategy spaces, the subsequent integrals can be 
converted to sums. Denoting the number of agents choosing —1 (+1) as A^_i(t) (A^+i(t)), 
the excess number choosing +1 over —1 represents the inefficiency of the system and is given 
by D(t) = N^i{t) — N_i{t). In the context of financial markets, D(t) would be proportional 
to the price-change representing the excess of demand over supply. Similar analysis can be 
carried out for any function of D{t), N^i{t) and/or N_i{t), and time- cumulative value of 
these quantities. Here we focus on D(t) which is given exactly by: 



where S_(t) is the current score- vector denoting the past performance of each strategy 
The combination of S_{t), \1/ and the game rule (e.g. use strategy with best or second-best 
performance to date) will define the number of agents n^^^'"^ using strategy R at time t. 
The action a'^^^ = ±1 is determined uniquely by n{t). 
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In conventional many-body Physics, we are either interested in the dynamical properties 



of D(t), such as the equation-of-motion, or its statistical properties ll|. Here we focus on 
these statistical properties: (i) the moments of the probability distribution function (PDF) 
of D{t) (e.g. mean, variance, kurtosis) and (ii) the correlation functions that are products 
of D{t) at various different times ti = t, t2 = t + t, = t + t' etc. (e.g. autocorrelation). 
Numerical multi-agent simulations typically average over time t and then over configurations 
{\E'}. A general expression to generate all such functions, is therefore 

Dp^''^"'-'>^{{D{h)D{t2)...D{tp)),)^ (2) 
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where 

V^^\R,, i?2 . . . , i?p; tl, t2 . . . , tp) ^ a^l^^a^l'^ . . . ^l^^ (3) 

resembles a time-dependent, non-translationally invariant, p-body interaction potential in 
R = {Ri, R2 . . . , Rp)-space, between p charge-densities {nj^^'^''^} of like-minded agents. 
Note that each charge-density Up-^"^'^ now possesses internal degrees of freedom determined 
by S_{t) and \1'. Since {^^f'"*'"^} are determined by the game's rules, Eq. (0) can be applied 
to any multi-agent game, not just MG. We focus here on moments of the PDF of D{t) 
where {tj} = t and hence {r} = 0. Discussion of temporal correlation functions such as 

(t) 

the autocorrelation D2 will be reported elsewhere. We consider explicitly the variance D2 
to demonstrate the approach, noting that higher-order moments such as (i.e. kurtosis) 
which classify the non-Gaussianity of the PDF, can be treated in a similar way. The potential 
V^^^ is insensitive to the configuration- average over hence the mean is given by [isj ]: 

D. = l^~m {v^^'(R^,t) {nf>-)^l . (4) 

If the game's output is unbiased, the averages yield Di = 0. This condition is not necessary 
- one can simply subtract Df from the right hand side of the expression for D2 below - 
however we will take Di = for clarity. The variance D2 measures the fluctuations of D(t) 
about its average value: 



where V^^\R,R']t) 

= '^R^^'^R'^^ 3.cts like a time-dependent, non-translationally invariant, 
two-body interaction potential in {R, i?')-space. Figure 2 illustrates a diagrammatic repre- 
sentation of Dp=i 2 in analogy with conventional many-body theory. 

The effective charge-densities and potential will fluctuate in time. It is reasonable to 
assume that the charge densities fluctuate around some mean value, hence n^''^'^ n^f^'^ = 
nRURi if) with mean 7x^71^/ plus a fluctuating term e^j^,'^ (t). This is a good approx- 

imation if we take i? to be a popularity-ranking (i.e. the i?th most popular strategy) or a 
strategy-performance ranking (i.e. the i?th best-performing strategy) since in these cases 
n^^*'*'* will be reasonably constant. For example, taking i? as a popularity-ranking implies 
^H=i* — ^^^=2"^ — ^^^=3* ^ • • thereby constraining the magnitude of the fluctuations in 
the charge- density n^*^''^ . Hence 

D2= f T'^'" dRdR' (v^'\R,R'-t) (nKn^' + 4t*W),) ■ (6) 
J Jr,r'=i ^ ^ /•ait 

We will assume that £^ri^ {t) averages out to zero. In the presence of network connections 

between agents, there can be strong correlations between these noise terms s^ri^ if) and 

the time-dependence of V^'^HR, R'\ t), implying that the averaging over t should be carried 

out step-by-step as in Ref. jlj]. For MG-like games without connections, the agents cannot 

suddenly access larger numbers of strategies and hence these correlations can be ignored. 

This gives 

D2= f dRdR' (v^^\R,R';t)) urUr, . (7) 

J Jr,r'=i \ 't 

As in conventional many-body theory, the expectation value in Eq. (|7j) can be 'contracted' 
down by makin g us e of the equal-time correlations between {a^*"*^}. As is known for MG- 



l.e ,a.ne. flffi, the complete st.te,v .pace wiU eo.tai. ..ate,es w.eK Kave e.aet, 
the same responses except for a few /i(t)'s. This quasi-redundancy can be removed by 
focusing on a reduced strategy space such that any pair R and R' are either (i) correlated, 
i.e. a^^*^ = a'^P for all (or nearly all) yu(t); (ii) anti-correlated, i.e. a'^^^ = —a'^P for all 
(or nearly all) fi{t); (iii) uncorrelated, i.e. a^*^*^ = a^^,*'' for half (or nearly half) of {yu(t)} 
while a'^*^ = —a'^P for the other half of {fi{t)}. Hence one can choose two subsets of G, 
i.e. Q = U (B U, such that the strategies within U are uncorrelated, the strategies within U 
are uncorrelated, the anticorrelated strategy of R & U appears in U, and the anticorrelated 
strategy of i? G f/ appears in U. We can therefore break up the integrals in Eq. ((Zj) into three 
parts: (i) R' R (i.e. correlated) hence ^ jl^^l"" dixa^^a^^. = 1 and (y^'^\R, R';t))^ = 1. 



(ii) R' R (i.e. anticorrelated) which yields -jj^ //l*™ d,f^(^R(^R' = ~1- If all possible global 
information values {/i} are visited reasonably equally over a long time-period, this implies 
(y^'^^R, R';t)'j^ = —1. For the MG, for example, {jj} corresponds to the m-bit histories 
which indeed are visited equally for small m. For large m, they are not visited equally for a 
given but are when averaged over all \E'. If, by contrast, we happened to be considering 
some general non-MG game where the /x's occur with unequal probabilities p^, even after 
averaging over all one can simply redefine the strategy subsets U and U to yield a 
generalized scalar product, i.e. J^^^l"^ dfia^aR, = —1 (or in case (iii)). (iii) R' -L R 
(i.e. uncorrelated) which yields j;;^ Jl^=r dpa^aR, = and hence (V^'^^R, R';t)'^^ = 0. 
Hence 

D2 = dRdR' {V^^'\R,R';t))^ urUr, = / dR {nRUR - urUr) 

J Jr,r'=i \ 't Jr=i ^ 

= dR {urHr - urUj^ + njfij^ - njfiR) 

JReu 

= I dR{nR-n^f. (8) 

Equation (jH)) must be evaluated together with the condition which guarantees that the total 
number of agents N is conserved: 

fRmax r 

N = dRnR= dR(nR + n-^) . (9) 

Jr=i JReu 

Equation (jHl) has a simple interpretation. Since ur and n-^ have opposite sign, they act like 
two charge-densities of opposite charge which tend to cancel: ur represents a Crowd of like- 
minded people, while n-^ corresponds to a like-minded Anticrowd who do exactly the opposite 
of the Crowd. We have effectively renormalized the charge- densities n^^^''^ and n^P''^ and 
their time- and position-dependent two-body interaction V^'^^{R, R';t) = aR^'^aRp, to give 
two identical Crowd-Anticrowd 'quasiparticles' of charge-density [ur — n-^) which interact 
via a time-independent a,nd position-independent mteiaction term V^g = 1. This is shown 
schematically in Fig. 2. The different types of Crowd-Anticrowd quasiparticle in Eq. (jH)) do 
not interact with each other, i.e. [hr — n-^) does not interact with {uri — n-^) ii R ^ R' . 
Interestingly, this situation could not arise in a conventional physical system containing just 
two types of charge (i.e. positive and negative). 

A given numerical simulation will employ a given strategy-allocation matrix (i.e. a given 
rank- 5* tensor) \E'. As -Rmax increases from 1 ^ oo, tends to become increasingly disordered 
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(i.e. increasingly non-uniform) since the ratio of the standard deviation to the mean 

number of agents holding a particular set of S strategies is equal to [(i?^^^ - l)/iV] 2 . There 
are two regimes: (i) A 'high-density' regime where -Rmax ^ N. Here the charge-densities 
{flu} tend to be large, non-zero values which monotonically decrease with increasing R. 
Hence the set {nn} acts like a smooth function n{R) = {n/?}. (ii) A 'low-density' regime 
where -Rmax 3> A^. Here \l/ becomes sparse with each element \I/r,r',r",... reduced to or 
1. The {ufi} should therefore be written as I's or O's in order to retain the discrete nature 
of the agents, and yet also satisfy Eq. Q 4]. Depending on the particular type of game, 
moving between regimes may or may not produce an observable feature. In the MG, for 
example. does not show an observable feature as fl„„ inereases - however does , 
We leave aside the discussion as to whether this constitutes a true phase-transition p, 
and instead discuss the explicit analytic expressions for D2 which result from Eq. (jH)). It 
is easy to show that the mean number of agents using the Xth most popular strategy (i.e. 
after averaging over \1/) is [J]: 



nx = N 



(10) 



The increasing non-uniformity in \1/ as -Rmax increases, means that the popularity-ranking 
of -R becomes increasingly independent of the popularity-ranking of R. Using Eq. (fT(H) with 
S = 2, and averaging over all possible R positions in Eq. (jH} to reflect the independence of 
the popularity-rankings for R and -R, we obtain: 



Do = Max 



o J-) I max n I p 



(11) 



The 'Max' operation ensures that as -Rmax increases and hence {rifj} ^ 0, 1, Eq. Q is 
still satisfied jj]. Equation (fTT|) underestimates D2 at small -Rmax (see Fig. 1) since it 
assumes that the rankings of -R and -R are unrelated, thereby overestimating the Crowd- 
Anticrowd cancellation. By contrast, an overestimate of D2 at small -Rmax can be obtained 
by considering the opposite limit whereby \1/ is sufficiently uniform that the popularity and 
strategy-performance rankings are identical. Hence the strategy with popularity-ranking X 
in Eq. (fTUI) is anticorrelated to the strategy with popularity-ranking -Rmax + I — X. This 
leads to a slightly modified first expression in Eq. (fTTjl : ^^^^ (1 — -RmL)- Figure 1 shows 

that the resulting analytical expressions reproduce the quantitative trends in the standard 

1/2 

deviation D2 observed numerically for all A^ and -Rmax, o-iT'd they describe the wide spread 
in the numerical data observed at small -Rmax- 



In summary, we have uncovered an explicit connection between multi-agent games and 
conventional many-body theory. This should not only help to bring multi-agent games 
closer to the Physics community, but it should also help the Physics community step into 
non-traditional areas of research where multi-agent simulations are now being actively used. 
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FIG. 1: (Color online) Results for the standard deviation of fluctuations D2 in the Minority 
Game. Numerical results correspond to 20 different runs at each N and i?max- The theoretical 
curves are generated using the analytic expressions in the text. The shaded area bounded by the 
upper and lower curves shows our theoretical prediction of the numerical spread for a given N . In 
line with the original numerical results of Ref. ^ , we have chosen successive i?max tick- values to 
increase by a factor of 4. 
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FIG. 2: Diagrammatic equivalent (schematic) of some simple pth order moments Dp oi the multi- 
agent output variable D{t). Top: p = 1 is the mean Di. Bottom: p = 2 is the variance D2- Also 
shown is an approximate expansion for D2 as represented by Eq. (jH)). Higher-order terms, which are 
neglected in Eq. (jSJ, would correspond to residual correlations between distinct Crowd- Anticrowd 
quasiparticles. 
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